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Whole-Genome Shotgun Assembly and Analysis of the Genome of 
Shiraia sp. Strain Slfl4, a Novel Endophytic Fungus Producing 
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Here, we report the draft genome sequence of Shiraia sp. strain Slfl4 (China Center for Type Culture Collection [CCTCC] no. 
209294), which is used to produce huperzine A and hypocrellin A. The genome sequence will allow for the characterization of the 
molecular mechanisms underlying its beneficial properties. 
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Huperzine A (HupA) , a lycopodium alkaloid isolated originally 
from Huperzia serrata, has attracted intense attention since 
its marked anticholinesterase activity was discovered by Chinese 
scientists ( 1 ) . HupA has been marketed in China as a new drug for 
Alzheimer's disease (AD) treatment and is currently used in the 
United States as a supplement for preventing further memory 
degeneration (2). However, the production of HupA from plants 
in large quantities is currently unsustainable because the plant 
resource is very scarce and the content of HupA in plants is ex- 
tremely low (3). Hypocrellin A (HypA), a pigment isolated origi- 
nally from the parasitic fungus Hypocrella bambusae, has a long 
history of use as a traditional medicinal agent to treat rheumatoid 
arthritis, gastric diseases, and skin diseases related to fungal infec- 
tions (4) . Surprisingly, HupA and HypA were recently found to be 
produced by various endophytic fungi, which are much more con- 
trollable than the plants due to simpler genetics and easier manip- 
ulation (3, 5). Shiraia sp. strain Slfl4, which can produce huper- 
zine A and hypocrellin A, is a novel endophytic fungus isolated by 
D. Zhu and his research team from Huperzia serrata in China (6). 
The fungus was collected in the China Center for Type Culture 
Collection (CCTCC) with the number 209294. Here, we report 
the first genome sequencing of Shiraia sp. Slfl4 in an attempt to 
identify the HupA and HypA biosynthetic gene clusters, and se- 
quencing of the genome may afford a basis for subsequent re- 
search that is directly connected to the field of natural product 
synthesis. 

The genome was sequenced using the Illumina Solexa HiSeq 
2000 instrument at the Beijing Genomics Institute (BGI), Shen- 
zhen, China. A library containing 500-bp inserts was constructed. 
Sequencing was performed with the paired-end strategy of 90- and 
90-bp reads to produce 2.0 Gb of filtered sequences, representing 
62.5-fold coverage of the genome. The sequences were assembled 
into 288 contigs (> 1,000 bp) using the SOAP denovo software (7). 
The Shiraia sp. Slfl4 chromosome is about 32 Mbp in length, with 
an average G+C content of 47.95%. Augustus was used as predic- 
tors. A total of 12,513 proteins were compared against the NCBI 



nonredundant (NR) protein databases by BLAST to forecast their 
biological functions (8). 

The genome sequence of Shiraia sp. Slfl4 is a valuable resource 
for identifying the genes involved in the biosynthesis of HupA and 
HypA, and it also serves as a platform for facilitating comparative 
genomics with other Shiraia sp. fungi. A putative HupA biosyn- 
thetic gene cluster is found in the Shiraia sp. Slfl4 genome, and the 
cluster has been deposited in the Genbank database (accession no. 
KF915812). Gene expression analysis and bioassays are needed for 
further investigation into these genes. The genome sequence will 
accelerate the progress of research on Shiraia sp. Slfl4. 

Nucleotide sequence accession numbers. This whole-genome 
shotgun project has been deposited at DDBJ/EMBL/GenBank un- 
der the accession no. AXZN00000000. The version described in 
this paper is version AXZN01000000. 
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